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PURIFICATION, CLONING AND BIOCHEMICAL CHARACTERIZATION 
OF XendoU. ENDORIBONUCLEASIC ACTIVITY INVOLVED IN SMALL 
NUCLEAR RNA SPLICING-INDEPENDENT BIOSYNTHESIS IN 
XENOPUS LAEVIS 

5 

Endoribonucleases play essential role in RNA metabolism 
participating both in "degradative" pathways, such as mRNA decay 
(Schoenberg and Chemokaiskaya, 1997). and in "maturative" pathways, to 
generate functional RNA molecules from primary transcripts. Only a few 

10 endoribonucleases have been isolated in eukaryotes, most of them being 
involved in biosynthesis of translation apparatus components. Among 
these, there are ribonucleoprotein enzymes, such as RNase P and RNase 
MRP, which act as site-specific endoribonucleases: RNase P is involved in 
generation of the 5' end of tRNAs (Nashimoto, 1995), whereas RNase 

15 MRP is implicated in processing of pre-rRNA (Lygerou et al., 1996). Other 
endonucleolytic activities, as 3'-tRNase, tRNA splicing endonuclease and 
members of RNase III family (Trotta et al., 1997; Bujnicki and Rychlewski, 
2000; Zamore, 2001) are protein enzymes. 3'-tRNase is an eukaryotic 
spermidine-dependent endoribonuclease which precisely removes the 3'- 

20 end tail from tRNA precursors (Castano et al., 1985; Nashimoto, 1995). 
tRNAs splicing endoribonucleases are required for the intron removal from 
pre-tRNAs: it is an Mg^^'-dependent enzyme and cleaves pre-tRNAs at 5' 
and 3' splice sites, releasing products having 2-3' cyclic phosphate and 
5'OH ends (Peebles et al., 1983). RNases III are endoribonucleases acting 

25 on double-strand RNA found in bacteria and eukaryotes: they were first 
isolated in E. coli (Court, 1993; Nicholson, 1997) and subsequently 
eukaryotic orthologs were identified on the basis of sequence similarity. S. 
cerevisiae RNase III (Rntip), was shown to be involved in several 
biosynthetic events such as pre-rRNAs, snRNAs and snoRNAs processing 

30 (Elela et al., 1996; Kufel et al., 1999; Chanfreau et al., 1997; Allmang et 
al., 1999; Chanfreau et al., 1998). Recently it was shown that Rntip is 
also involved in the release of the intron-encoded snoRNAs, U18 and 
snR38 from their pre-mRNAs (Giorgi et al., 2001). Furthermore, a new 
member of eukaryotic RNAses III family, named "Dicer", has been 

35 identified in metazoa (Bemstein et al., 2001), It is known to be involved in 
interference (RNAi) pathway, generating 21-23 nt small interfering RNAs 
(siRNAs) from longer partially double stranded precursors. These 
processing products act in RNA-mediated gene regulation (Ambros, 



2001). Cleavage by Rnase III releases 3'OH and 5' phosphate ends and it 
is Mg^^-dependent. 

The authors of the invention previously demonstrated that an 
endoribonucleolytic activity plays a key role in the biosynthesis of the box 
5 C/D U16 snoRNA, encoded in the third intron of the L4 ribosomal protein 
gene in X. laevis (Caffarelli et al., 1994). The authors already showed that 
U16 processing from the host intron is alternative to the splicing reaction: 
thereby, synthesis of the L4 mRNA is alternative to the production of U16 
snoRNA (Caffarelli et al., 1996). In this context the biosynthetic 

10 mechanism of U16, per se, regulates the expression of L4 gene at the 
post-transcriptional level. 

The authors now purified to homogeneity from X, laevis oocyte 
nuclear extracts (ONE) and characterized the endonucleolytic activity 
(XendoU, GenBank TM/EBI Data Bank AJ507315) responsible for the 

15 processing of U16 snoRNA from its host intron. Partial protein sequencing 
allowed to clone XendoU cDNA, to express it and to study features of the 
enzyme. This protein represents a novel endoribonucleasic activity, being: 
i) poly-U specific, ii) single filament specific, iii) Mn'*^^-dependent and iv) 
able to release cleavage products with 5' OH and 2-3' cyclic phosphate 

20 ends. 

It is therefore an object of the present invention a nucleic acid 
encoding for a protein with endoribonucleasic activity which is poliU 
sequences specific, Mn^^-dependent and releases as cleavage products 
molecules having 2-3' cyclic phosphate and 5'OH ends. Preferably nucleic 
25 acid includes substantially nucleotide sequence of figure 4, functional 
homologs thereof or complementary sequence thereto. 

It is a further object of the invention a recombinant vector able 
to express effectively the inventive nucleic acid in prokaryotes. 

It is a further object of the invention a recombinant vector able 
30 to express effectively the inventive nucleic acid in eukaryotes 

Those skilled in the art will be able to recognize the most 
suitable vectors also considering the host organism in order to express the 
inventive nucleic acid. 

It is a further object of the invention a protein with 
35 endoribonucleasic activity which is poly-U sequences specific, Mn"^*- 
dependent and releases RNA molecules bearing 2', 3' cyclic phosphate 
and 5'OH ends as cleavage products, or functional portions thereof. 



Preferably the protein is encoded by the nucleic acid of the invention, 
more preferably protein substantially has amino acid sequence of figure 4. 
Advantageously the protein is produced by synthetic or recombinant route 
using methods known by those skilled in the art. 
5 The invention will be now described without any limitation 

thereof referring to experimental procedures wherein reference will be 
made to the following figures: 

Figure 1 shows activity assay and purification of XendoU. A. 
schematic representation of U16 snoRNA processing. P indicates U16- 

10 containing precursor (003 RNA); 1-1 a and 1-1 b represent the products 
generated by cleavage upstream from U16 while l-2a and l-2b represent 
their 3' complementary molecules thereto; 1-3 depicts the product 
originated from cleavage downstream from U16 while 1-4 represent its 3* 
respective complementary molecule; pre-U16 represents U16 precursor 

15 with additional 5' and 3* flanking sequences. Cap structure is shown as a 
black dot, exons as boxes, the intron as a continuous line and U16 
snoRNA coding region as a thicker line. Large arrows localise the major 
sites of cleavage and small arrows the minor ones. B, in vitro U16 
processing in unfractionated oocyte nuclear extracts (ONE) and with 

20 purified XendoU (XendoU). ^^P-labelled 003 RNA was incubated for the 
times indicated below; RNA was then extracted and separated by 6% 
polyacrilamide-7M urea gel. The specific cleavage products are indicated 
aside. C, scheme of procedure employed for XendoU purification from 
ONE. D, proteins from the active fractions during the purification were 

25 separated on SDS-PAGE and visualised by Blue Coomassie staining. 
Numerals below refer to the corresponding purification steps schematised 
in panel C. The arrow points to the purified enzyme with an apparent 
molecular mass of 37 kDa. 

Figure 2 shows XendoU cleavage which requires Mn"*""*" cations. 

30 ^^P-labelled 003 RNA was incubated in oocyte nuclear extract (ONE) or 
with purified XendoU (XendoU) in the presence of different metal ions at a 
concentration of 6 mM. After 30 min, the reaction was stopped and the 
processing products analysed on 6% polyacrilamide-7M urea gel. In the 
lane - the RNA substrate was incubated with XendoU in the absence of 

35 ions. The specific cleavage products are schematised on the left side . 

Figure 3 shows that the XendoU activity is U-specific and 
produces 2-3' cyclic phosphate. A, ^^P ATP labelled synthetic 



oligoribonucleotide PI, containing the distal cleavage site upstream from 
U16, and its mutant derivatives (P2, P3) were incubated with the 
unfractionated extract (lanes 2) or with purified XendoU (lanes 3), under 
standard conditions for 30 min. RNA molecules were extracted and 
5 analysed on 10% polyacrylamide-7M urea gel. In lanes 1 untreated RNA is 
shown, in lane M RNA marker generated by alkaline digestion of P1 is 
shown; arrows indicate cleavage sites. On the right side the sequences of 
the oligoribonucleotides are reported. B, The ^^P-labelled 1-1 molecules, 
schematically represented on the left side, generated by incubation of 

10 U16-containing precursor in ONE (ONE), with purified XendoU (XendoU) 
or after injection in oocytes {in vivo), were gel purified, and their 3'end 
analysed. The molecules were incubated with 1 unit of alkaline 
phosphatase (lane 1) or with 10 mM HCI (lanes 2) or with alkaline 
phosphatase after acid treatment (lanes 3). After incubation the RNAs 

15 extracts were analysed by electrophoresis on 10 % polyacrylamide-7M 
urea gel. Untreated molecules were run as control in lanes 4. 

Figure 4 shows cDNA and amino acid sequence of XendoU. 
Nucleotides of the 5' and 3' untranslated regions are shown in small 
letters, nucleotides of the ORF in capital letters. Above each codon the 

20 corresponding amino acid is shown. The sequence portions determined by 
automated Edman degradation and mass mapping experiments (see 
"Experimental Procedures") are indicated by numbers 1, 2 and 3. The stop 
codon is identified by an asterisk. Numbers on the right side of diagram 
correspond to the amino acid residues. Underlined are the amino acid 

25 sequences identified by MALDI-mapping experiments. 

Figure 5 shows the functional analysis of in vitro translated 
XendoU A, SDS-PAGE analysis of [^^S] Methionine-labelled XendoU (lane 
2) and of control luciferase (lane 1) produced by in vitro transcription and 
translation. The arrow points to XendoU protein. B, 003 RNA was 

30 incubated, in the presence of Mn"*^^ ions, in ONE (ONE), with XendoU 
produced by reticulocyte lysate (ret-XendoU/+Mn), or with reticulocyte 
lysate as such (lane ret/+Mn). As control 003 RNA was incubated, in the 
absence of Mn*^ ions, with XendoU produced by reticulocyte lysate (lanes 
ret-XendoUAMn). The numbers below indicate incubation times: 0 min 

35 (lane 1), 45 min (lanes 2). 

Figure 6 shows that XendoU is involved in U86 snoRNA 
biosynthesis. A, U86 processing is analysed in vivo by injection of ^^P- 



labelled U86-containing precursor (P) in X. laevis oocytes (lanes in vivo), 
or in vitro by incubation of the RNA precursor in ONE (ONE), or with 
purified XendoU (XendoU). The numbers below indicate different 
incubation times: 0 min (lane 1), 10 min (lanes 2), 45 min (lanes 3), 3 
5 hours (lanes 4), 16 hours (lanes 5). RNA was then extracted and loaded 
on a 6% polyacrilamide-7M urea gel. The processing products are 
schematised aside. Arrows indicate specific XendoU cleavages. B, ^^P- 
labelled Uhindlll primer, depicted below, was reacted with unlabelled 1-4 
molecules obtained after 10 min of incubation in oocytes (lane in vivo), 45 

10 min of incubation in ONE (ONE) or 45 min of incubation with purified 
XendoU (XendoU). The products of primer extension were run in parallel 
with the sequence (lane G, A. T, and C) performed with the same 
oligonucleotide on U86. The sequence is reported on the left side: the 
arrow points to the XendoU cleavage sites. C, U86-containing precursor 

15 was incubated, in the presence of Mn*"" ions, in ONE (ONE), with XendoU 
produced by reticulocyte lysate (ret-XendoU/+Mn), or with reticulocyte 
lysate as such (lanes ret/+Mn). As control, pre-mRNA was incubated with 
XendoU produced by reticulocyte lysate in the absence of Mn^"" ions (ret- 
XendoU/-Mn\ The numbers below indicate incubation times: 0 min (lane 

20 1), 45 min (lanes 2). 

EXPERIMENTAL PROCEDURES 

Massive isolation of oocyte germinal vesicles and nuclear 
extract preparation 

X. laevis germinal vesicles were isolated following the 
25 procedure by Gandini-Attardi et al.(Gandini-Attardi et al., 1990 ) and the 
nuclear extracts were prepared as already described (Caffarelli et 
al.,1994). 

Purification of XendoU activity 

XendoU was purified from oocyte nuclear extracts (ONE). ONE 
30 was fractionated by ammonium sulphate precipitation. Solid ammonium 
sulphate (280 mg/ml) was slowly added to the nuclear extract up to 45% 
saturation and the suspension was stirred for 30' at 4*^0 and then 
centrifuged at 12.000 rpm for 30' at 4°C. The supernatant was made 70% 
saturated by a further addition of ammonium sulphate (240 mg/ml). The 
35 suspension was stirred and centrifuged as above. The resulting pellet was 
dissolved in ONE buffer (25 mM Hepes pH 7.5, 50 mM NaCI, 0.1 mM 
EDTA. 1 mM DTT, 10% glycerol) and then applied onto an hydroxyapatite 



column (CHT-II Econocolumn, Biorad). Column was washed with ONE 
buffer and then eluted with 5 column volumes of 100 mM Na-phosphate 
pH 7 in ONE buffer. The eluate was collected in 1 ml fractions next tested 
for the endonuclease activity. Selected fractions were pooled, diluted with 
5 3 volumes of ONE buffer and applied on a Blue Sepharose column (Blue 
Sepharose Fast Flow Pharmacia). The column was washed with ONE 
buffer and then eluted with 5 column volumes of 0.2 M NaCI in ONE 
buffer. The eluate was then collected in 1 ml fractions; those displaying the 
specific activity were pooled and dialysed against ONE buffer. Protein 

10 mixture was subjected to a second fractionation on hydroxyapatite column. 
The elution was performed with 10 column volumes of a linear gradient 0- 
100 mM Na-phosphate pH 7 in ONE buffer. Collected fractions were 
tested and those with activity were pooled and concentrated by means of 
ultrafiltration device (Centricon C10, Millipore). The concentrated fractions 

15 were then applied on a gel-filtration column (Pharmacia) previously 
equilibrated in ONE buffer, Elution was monitored collecting 0,5 ml 
fractions which were tested for specific activity. 

Considering the yield of purified protein it can be assumed that 
XendoU represents no more than 1/1000 of the protein mass present in 

20 nuclear extract. To obtain enough protein for sequencing and 
characterization the described procedure was carried out on several ONE 
samples of 15-20 ml collecting together the final purified fractions. 
Preparation and isolation of trvptic peptides 
Protein bands from SDS-PAGE analysis (5 |ag) stained with 

25 Coomassie Blue R250 were excised, reduced with dithiothreitol and 
carboxamidomethylated. Gel pieces were equilibrated in 25 mM NH4HCO3, 
pH 8 and finally digested in situ with trypsin at 37 °C for 18 h. Peptides 
were extracted by sonication with 100 ^\ of 25 mM NH4HC03/acetonitrile 
1:1 v/v, pH 8 (twice). Peptide mixture was fractionated by reverse-phase 

30 HPLC on a Vydac C18 column 218TP52 (250 x 1 mm), 5 |am, 300 A pore 
size (The Separation Group, USA) by using a linear gradient from 5% to 
60% of acetonitrile in 0.1% TFA over 60 min, at flow rate of 90 |aL/min. 
Individual components were manually collected and lyophilised. 

Peptide Sequencing and Mass spectrometry analysis 

35 Sequence analysis was performed using a Precise 491 protein 

sequencer (Applied Biosystems, USA) equipped with a 140C 



7 

microgradient apparatus and a 785A UV detector (Applied Biosystems, 
USA) for the automated identification of PTH-amino acids. 

Matrix assisted laser desorption ionization mass spectra were 
recorded using a Voyager DE-PRO mass spectrometer (Applied 
5 Biosystems, USA); an analytical mixture containing a-cyano-4-hydroxy- 
cinnamic acid was applied to the sample plate and allowed to be dried. 
Mass calibration was performed using the molecular ions from peptides 
produced by tryptic auto-proteolysis and the matrix as internal standards. 
In vitro RNA synthesis and oocyte microinjection 
10 U16-containing precursor (003 RNA), including the third intron 

of the L4 r-protein gene of X. laevis, was transcribed from plasmid 003 
digested with Hindlll (Caffareili et a!., 1994). U86-containing precursor was 
obtained as already described (Filippini et al., 2001). In vitro transcription 
reactions were performed in the presence of (^^P) aUTP as described 
15 (Caffareili et al.. 1998) and pre-mRNAs were injected into nuclei of stage 
VI oocytes as already described (Caffareili et al., 1994). 
In vitro processing reactions 

ONE assay: as described by (Caffareili et al., 1994). 
XendoU assay: the reaction mixture (25 contained 3X10"^ 
20 cpm of (^^P) labelled pre-mRNA, 6 mM MnCb, 50 mM NaCI, 25 mM 
Hepes pH 7.5, 1 mM DTT, 10 |ag of E. Coli tRNA, 20 U of RNAse inhibitor 
(PROMEGA) and 1 ng of purified XendoU. Reaction mixtures were 
incubated with RNA substrates at 24''C for indicated times. The products 
of the reactions were analysed on 6% polyacrylamide-7M urea gels. 
25 Substrate specificity 

The oligoribonucleotides 

PI (5'-GGAAACGUAUCCUUUGGGAG-3'). SEQ ID No 3; 
P2 (5'-GGAAACGUAUCCUUGGGAGT-3'), SEQ ID No 4; 
P3 (5'-GGAAACGUAUCCUCUGGGAG-3'), SEQ ID No 5; 
30 P4 (5'-GGAAACGUAUCCUGUGGGAG-3'), SEQ ID No 6; 

were 5' labelled: 10 pmol of each synthetic substrate were 
incubated at 37°C for 30 min, in the presence of 10 units of Polynucleotide 
Kinase (Roche), and 10 jiCi of (^^P)y-atp. The reaction was terminated at 
65°C for 5 min, primers were gel purified on 10% polyacrylamide-7M urea 
35 and incubated for 30 min in the presence of ONE or purified XendoU as 
described above. RNA was extracted and analysed on 10% 
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polyacrylamide-denaturing gel. RNA ladder was obtained by incubation of 
P4 oligo (200.000 cpm) in 500mM NaHCOs at 90^C for 20 min. 
Analysis of 3' ends of cleavage products 

^^P-labelled and gel purified 1-1 molecules (1-1 a and 1-1 b 
5 generated by cleavages at the major sites a and b upstream from U16, 
see scheme of Figure 1A), obtained in vitro after incubation of 003 RNA 
with ONE or XendoU, or in vivo after oocyte microinjection, were incubated 
in 10 ^il of 10 mM HCI at 25°C for 2 hours to open the cyclic phosphate as 
described by Forster (Forster et a!., 1990). The phosphate was then 

10 removed by incubation of the RNA in 50 mM Tris-HCI pH 8.5, 0,1 mM 
EDTA in the presence of 1 U of calf intestine Alkaline Phosphatase at 
50°C for 60 min. The enzyme was inactivated by adding 1/10 volume of 
0.2 M EGTA and the mixture heated at 65*^0 for 10 min. RNA was 
extracted with phenol/chloroform and analysed on 10% polyacrylamide-7M 

15 urea gel. 

Isolation of XendoU cDNA 

A X. laevis stage 28 embryo cDNA library, constructed in ?tZAP 
II vector, was screened using a specific probe obtained by PGR 
amplification on X. laevis cDNA with degenerated oligonucleotides (MAHs 

20 5'-ATGGCICAYGAYTAYYTIGT-3'. SEQ ID No 7 and IGTa 5'- 
ACIGGRTAIGCIGTICCIAT-3', SEQ ID No 8) designed on peptides 
obtained by tryptic digestion of purified XendoU. 

XendoU cDNA expression in reticulocyte lysate 

XendoU Open Reading Frame (ORF) was cloned into Blue 

25 Script vector and ^^(S)Methionine-labelled protein was produced by in vitro 
transcription and translation using the TnT-coupled Reticulocyte Lysate 
System Kit (PROMEGA) according to the manufacturer's instructions. 
Translational products were analysed on 10% SDS-PAGE. 
Primer extension analysis 

30 In vitro transcribed U86-containing precursor was obtained from 

a standard T7 reaction, but in the presence of 500 |liM unlabelled UTP. 
The transcript was injected into X. laevis oocytes or incubated In ONE or 
with purified XendoU. The processing product 1-4 RNA was gel-purified 
and reverse transcribed (SS pre-amplification system - GIBCO) with the 5' 

35 terminally labelled oligonucleotide UHindlll (5- 

AAGCTTCTTCATGGCGGCTCGGCCAAT-3', SEQ ID No 9) 
complementary to 19 nucleotides at the 3' end of the downstream exon. 



The elongated products were run in parallel with the sequence obtained 
with the same primer on U86-containing precursor. 
RESULTS 

Purification of XendoU from X. laevis oocyte nuclear extracts 
5 The authors previously developed an in vitro system able to 

reproduce the release of U16 snoRNA from its host intron (Caffarelli et a!., 
1994). When ^^P-labelled U16-containing precursor is incubated in X. 
laevis oocyte nuclear extract, in the presence of Mn*"*^ ions, specific 
products, originating from endonucleolityc cleavages, are found (Figure 1A 

10 and IB): the 1-2 and 1-1 molecules derive from cleavage upstream from 
U16 coding region, while 1-3 and 1-4 molecules are produced by cleavage 
downstream from U16. When double cleavage occurs on the same pre- 
mRNA molecule, pre-U16 products accumulate, which are then converted 
to the mature snoRNA by exonucleotidic digestion 

15 The procedure of biochemical characterization is indicated in 

Figure 1C. while the protein content of the fractions displaying XendoU 
activity is shown in Figure 1D. After several chromatographic steps, a 
single protein of 37 kDa molecular mass was obtained as showed in 
Figure ID, lane 6. The elution of XendoU activity during the last 

20 purification step on gel filtration column is consistent with a monomeric 
protein of 37 kDa. The assay for testing the presence of XendoU activity 
was performed by incubating ^^P-labelled U16-containing precursor with 
aliquots of the different fractionation steps. Figure IB shows the 
comparison of XendoU activity of unfractionated nuclear extracts (ONE) 

25 with that of the gel filtration column (XendoU). Since previously it was 
demonstrated the dependence of XendoU activity on Mn"^"*" ions, this 
cofactor is always present in the reaction: in both cases the same primary 
cleavage products, 1-2 and 1-3 and their complementary molecules 1-1 and 
1-4, are generated. Primer extension analysis performed on cleavage 

30 products 1-2 and 1-4 revealed that the purified enzyme cleaves intronic 
sequences at the same U-rich in vivo recognized areas (Caffarelli et 
al,.1994). 

Characterization of XendoU cleavage 

The analysis of the ion dependence of cleavage was carried out 
35 by incubating '^^P-labelled U16 containing precursor with ONE or with 
purified XendoU in the presence of different cations as shown in Figure 2. 
In both cases Cd^^, Zn*^, Ni""*, Co^^ and Pb^^ do not activate cleavage; 



10 

whereas Mn""^, and to a minor extent Mg""^, produce the appearance of the 
specific cleavage products. These results indicate a Mn^^ ion requirement 
for XendoU full activity. 

The substrate selectivity of XendoU was further addressed by 
5 incubating the purified enzyme with synthetic oligoribonucleotide (P1), 
containing the distal XendoU cleavage site (site d, Figure 1A), localised 
upstream from U16, and with mutated derivatives thereof (P2 and P3). 
The obtained results, shown in Figure 3A, indicate that XendoU displays 
the same selectivity observed in vivo and that the minimal consensus 

10 cleavage site is an uracyl dimer. 

The incubation of the enzyme with double stranded 
oligoribonucleotides, of 21 nucleotides, containing U-rich sequences 
demonstrated that only single stranded RNAs are recognized by XendoU. 

The chemistry of XendoU cleavage was then assessed by 

15 determining the chemical nature of the ends of the cleaved products. To 
this aim we analysed the ends of ^^P-labelled Ma and 1-1 b molecules 
produced with ONE or with the purified XendoU: these molecules were gel 
purified and treated with alkaline phosphatase, HCI, or both. Figure 3B 
shows that a slight decrease in migration, due to the loss of a negative 

20 charge, is obtained when the alkaline phosphatase treatment follows the 
HCI treatment (lanes 3). This result is obtained both with ONE (lanes 
ONE) and purified XendoU (lanes XendoU) and indicates that the 3' end of 
the molecules bears a phosphate group which is present in a 2'-3' cyclic 
form (Lund and Dahiberg, 1992; Forster et al., 1990). In fact, only after the 

25 acidic treatment the phosphate group can be removed by phosphatase 
resulting in a slight electrophoresis delay. This effect is observed both for 
the 1-1 a and 1-1 b molecules obtained with unfractionated oocyte nuclear 
extract and with XendoU. As previously reported (Caffarelli et al..1996), 
the products of primary cleavage such as 1-1 molecules, are in vivo 

30 unstable being rapidly degraded, after cleavage. Nevertheless, after very 
short incubation times, little amounts of Mb molecules can be purified and 
subjected to the same treatment as described above. Figure 3B (lanes in 
vivo) indicates that also in this case a slight delay in migration is obtained 
(lane 3), demonstrating that the products of the in vivo reaction display 2 - 

35 3' cyclic phosphates as well. 
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Isolation of cDNA for XendoU 

After purification, protein samples from SDS-PAGE were 
reduced, alkylated and digested with trypsin as reported in the 
experimental section. The resulting peptide mixture was resolved by R- 
5 HPLC and selected peptide fractions were submitted to automated Edman 
degradation. The three sequence portions determined are reported in 
Figure 4 (indicated as 1, 2 and 3). From these amino acid sequences, 
degenerated oligonucleotides were derived and employed in different 
combinations and different orientation in PGR amplification reactions on 

10 cDNA from polyA* RNA extracted from X. laevis oocytes. Only the reaction 
performed with sequence 1 (forward) and sequence 3 (reverse) resulted in 
an amplification product (500 bp). Sequencing of this product indicated the 
presence of an Open Reading Frame containing peptide 2. This cDNA 
probe was then utilised for the screening of a X. laevis stage 28 embryo 

15 cDNA library, allowing the isolation of a full-length cDNA (figure 4). The 
amino acid sequence determined was confirmed by MALDI-MS spectra of 
the tryptic peptides. In fact, signals observed at m/z 565.29, 814.45, 
1004.48, 10025.54, 1132.59, 1190.60, 1490.78, 1504.80, 1520.70, 
1729.91, 1758.82. 1988.08, 2000.00, 2014.01, 2076.99, 2162.98, 

20 2234.14, 2238.05, 2394.15. 2432.26, 3058.51 and 3370.66 were ascribed 
to peptides 196-200, 126-132, 117-124, 6-14, 117-125, 41-52, 114-125, 
260-271, 15-26, 137-149, 53-67, 256-271, 53-69, 275-292, 204-220, 150- 
169. 133-149, 171-188, 170-188, 117-136, 6-31 and Ac4-31. This result 
allowed to cover 65% of the entire sequence, explaining the reluctance of 

25 a blotted protein sample to Edman degradation. 

Cloning and expression of XendoU cDNA 
XendoU ORF. 876 bp, was cloned into Blue Script vector and 
the protein was produced by in vitro transcription and translation using 
reticulocyte lysate. The translational product was analysed on SDS-PAGE 

30 revealing a protein of 37 kDa molecular mass (Figure 5 A). In order to 
assess the nature of the 37 kDa protein, enzymatic activity was assayed 
by incubating the ^^P-labelled RNA substrate in reticulocyte lysate 
expressing XendoU ORF. The activity assay was carried out in parallel 
with the unfractionated extract: Figure 5B shows that the cleavage 

35 products generated by the translated 37 kDa protein (lane ret- 
XendoU/+Mn) exactly match those obtained with the extract (lanes ONE). 
Furthermore, the lack of cleavage when Mn'"^ ions are not added to the 
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reaction mixture (lane ret-XendoU/-Mn) confirms the specific ion 
requirement for XendoU and suggests that the binding to the cofactor is 
reversible. As negative control the activity assay was carried out by 
incubating RNA substrate in reticulocyte lysate as such, in the presence of 
5 Mn"""" ions (lane ret/+Mn). 

XendoU also participates to U86 snoRNA biosynthesis 
The authors previously identified a snoRNA, named U86, 
encoded by an intron of Nop56 gene of X. laevis. It was also shown that 
U86 has homologs both in human, where it displays the same genomic 

10 organization, and in yeast where it is embedded in the ORF of Ribi gene 
(Filippini et al., 2001). As U16 snoRNA, also U86 is contained in a very 
poorly spliceable intron and its biosynthesis appears to be alternative to 
that of the co-transcribed mRNA. The injection of ^^P-labelled U86- 
containing precursor into X. laevis oocytes generates the truncated 

15 products 1-2 and 1-3 and their 5' and 3* complementary molecules, 1-1 and 
1-4, by means of upstream and downstream cleavages from U86 coding 
region (Figure 6A, lanes in vivo). 

Processing of U86-containing precursor with purified XendoU 
(Figure 6A, lanes XendoU) or with the reticulocyte lysate expressing 

20 XendoU ORF (Figure 6C, lane ret-XendoU/+ Mn) demonstrates that the 
enzyme is responsible for the occurring cleavage downstream from U86 
coding region. The effector of the cleavage upstream from U86, that 
produces 1-2 and 1-1 molecules, is not yet known and it is lost in oocyte 
nuclear extracts (Figure 6A, lanes ONE). The XendoU cleavage sites, 

25 downstream from U86, were determined by primer extension on 1-4 
molecules and correspond to two U-rich sequences (Figure 6B). 
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CLAIMS 

1. Nucleic acid encoding for a protein with endoribonucleasic 
activity wherein said protein with endoribonucleasic activity is 
characterized in that it is polyU specific, Mn^"*^ ions dependent and able to 
release 2-3' cyclic phosphate and 5'OH RNA ends cleavage products. 

2. Nucleic acid according to claim 1 substantially including 
nucleotide sequence of figure 4, functional homologs thereof or a 
complennentary sequence thereto. 

3. Recombinant vector able to express effectively the inventive 
nucleic acid in prokaryotes according to claims 1 or 2. 

4. Recombinant vector able to express effectively the inventive 
nucleic acid in eukaryotes according to claims 1 or 2. 

5. Protein with endoribonucleasic activity characterized in that it 
is polyU specific, Mn^"^ ions dependent, and able to release 2-3' cyclic 
phosphate and 5'OH RNA ends cleavage products or functional portions 
thereof. 

6. Protein according to claim 5 encoded by nucleic acid 
according to claims 1 or 2, 

7. Protein according to claim 6 having substantially the amino 
acid sequence of figure 4. 
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PURIFICATION. CLONING AND BIOCHEMICAL CHARACTERIZATION 
OF XendoU. ENDORIBONUCLEASIC ACTIVITY INVOLVED IN SMALL 
NUCLEAR RNA SPLICING-INDEPENDENT BIOSYNTHESIS IN 
XENOPUS LAEVIS 

ABSTRACT 

Cloning and characterization of the gene for Mn** dependent 
RNA-se endoribonucleasic activity able to generate 2-3' cyclic phosphate 
and 5'OH ends. 
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<210> 1 
<211> 1265 
<212> DNA 

<213> Xenopus laevis 

<220> 

<221> CDS 

<222> (39) . . (914) 

<400> 1 

attggggaac tgggagcaga gagtgacggg caggagcc atg gcg agt aac agg ggg 56 

Met Ala Ser Asn Arg Gly 
1 5 

cag ctg aac cat gaa etc tec aag ctg ttt aat gag ctg tgg gac gca 104 
Gin Leu Asn His Glu Leu Ser Lys Leu Phe Asn Glu Leu Trp Asp Ala 
10 15 20 

gat cag aac egg atg aag tec ggg aag gat tat egg ate tee ttg cag 152 
Asp Gin Asn Arg Met Lys Ser Gly Lys Asp Tyr Arg lie Ser Leu Gin 
25 30 35 

ggt aaa gca ggg tac gta ccc gcc ggt tec aac cag gee agg gac age 200 
Gly Lys Ala Gly Tyr Val Pro Ala Gly Ser Asn Gin Ala Arg Asp Ser 
40 45 50 

gcc teg tte ecg etc ttc cag ttc gtc gat gag gag aag ctg aag age 248 
Ala Ser Phe Pro Leu Phe Gin Phe Val Asp Glu Glu Lys Leu Lys Ser 
55 60 65 70 



agg aag acg ttt gca acc ttc att tec ctg ctg gac aat tat gag atg 296 
Arg Lys Thr Phe Ala Thr Phe lie Ser Leu Leu Asp Asn Tyr Glu Met 
75 80 85 

gac acg ggg gtg gcc gag gtt gtg act ccg gag gaa ate get gaa aac 344 
Asp Thr Gly Val Ala Glu Val Val Thr Pro Glu Glu He Ala Glu Asn 
90 95 100 

aac aac ttc ctg gac gcc att ctg gaa acc aaa gtg atg aag atg gca 392 
Asn Asn Phe Leu Asp Ala He Leu Glu Thr Lys Val Met Lys Met Ala 
105 110 115 

cat gac tac ctg gtg agg aag aac caa gcc aaa ccc acc egg aat gac 440 
His Asp Tyr Leu Val Arg Lys Asn Gin Ala Lys Pro Thr Arg Asn Asp 
120 125 130 

ttc aag gtc caa ctg tac aac ate tgg tte cag ctg tac tea egg gcc 488 
Phe Lys Val Gin Leu Tyr Asn He Trp Phe Gin Leu Tyr Ser Arg Ala 
135 140 145 150 

cca ggg age aga ccc gat teg tgc ggc ttt gag cac gtg ttt gtg gga 536 
Pro Gly Ser Arg Pro Asp Ser Cys Gly Phe Glu His Val Phe Val Gly 

155 160 165 

gaa teg aag ega ggg cag gag atg atg ggg ett cac aac tgg gtc cag 584 
Glu Ser Lys Arg Gly Gin Glu Met Met Gly Leu His Asn Trp Val Gin 
170 175 180 

ttt tac ctt cag gag aag agg aag aac ate gac tat aaa gga tac gtg 632 
Phe Tyr Leu Gin Glu Lys Arg Lys Asn He Asp Tyr Lys Gly Tyr Val 
185 190 195 

get egg cag aac aag agt egg ccg gat gaa gat gat cag gtg ttg aac 680 
Ala Arg Gin Asn Lys Ser Arg Pro Asp Glu Asp Asp Gin Val Leu Asn 
200 205 210 

ctg cag tte aat tgg aag gag atg gtg aaa ccc gtc ggc age age tte 728 
Leu Gin Phe Asn Trp Lys Glu Met Val Lys Pro Val Gly Ser Ser Phe 
215 220 225 230 

att ggc gtc age ccg gaa ttc gaa ttc gee ett tac ace ate gtc tte 776 
He Gly Val Ser Pro Glu Phe Glu Phe Ala Leu Tyr Thr He Val Phe 
235 240 245 

etc gcg tet cag gag aag atg age ega gaa gtc gtt egg ctg gaa gaa 824 
Leu Ala Ser Gin Glu Lys Met Ser Arg Glu Val Val Arg Leu Glu Glu 
250 255 260 

tac gaa ctg cag ate gtc gtc aat cgc cac ggc egt tat ata ggg ace 872 
Tyr Glu Leu Gin He Val Val Asn Arg His Gly Arg Tyr He Gly Thr 
265 270 275 

gee tac ccc gtc etc ctg age ace aat aac ccg gat ctg tac 914 
Ala Tyr Pro Val Leu Leu Ser Thr Asn Asn Pro Asp Leu Tyr 
280 285 290 



tgagggggeg gggetagaga teaeageegg tteeeacggt ttgggtgeat ttaetaaeaa 974 



aactgcacca atgcaacaat gcaagcagat aatgggggca ggtccatatc cctctgcttt 1034 
ccctagcgtg tgtggggcac attaacccta taactgtcac tcactgcacc agacccatta 1094 
tttaacccca caagggacat caagccagtg ccttgttatg agagagcgca gccggggctt 1154 
ctctactgtg aaacttctgt attgtataga gtttacttgg tttcttcctc cagacaattt 1214 
cacttttttt ttgctttgcc tttaaccatt aaaagtccat gacatttctg t 1265 



<210> 2 
<211> 292 
<212> PRT 

<213> Xenopus laevis 
<400> 2 

Met Ala Ser Asn Arg Gly Gin Leu Asn His Glu Leu Ser Lys Leu Phe 
15 10 15 

Asn Glu Leu Trp Asp Ala Asp Gin Asn Arg Met Lys Ser Gly Lys Asp 
20 25 30 

Tyr Arg lie Ser Leu Gin Gly Lys Ala Gly Tyr Val Pro Ala Gly Ser 

35 40 45 . 

Asn Gin Ala Arg Asp Ser Ala Ser Phe Pro Leu Phe Gin Phe Val Asp 
50 55 60 

Glu Glu Lys Leu Lys Ser Arg Lys Thr Phe Ala Thr Phe lie Ser Leu 
65 70 75 80 

Leu Asp Asn Tyr Glu Met Asp Thr Gly Val Ala Glu Val Val Thr Pro 

85 90 95 

Glu Glu lie Ala Glu Asn Asn Asn Phe Leu Asp Ala lie Leu Glu Thr 
100 105 110 

Lys Val Met Lys Met Ala His Asp Tyr Leu Val Arg Lys Asn Gin Ala 
115 120 125 

Lys Pro Thr Arg Asn Asp Phe Lys Val Gin Leu Tyr Asn lie Trp Phe 
130 135 140 

Gin Leu Tyr Ser Arg Ala Pro Gly Ser Arg Pro Asp Ser Cys Gly Phe 
145 150 155 160 

Glu His Val Phe Val Gly Glu Ser Lys Arg Gly Gin Glu Met Met Gly 
165 170 175 

Leu His Asn Trp Val Gin Phe Tyr Leu Gin Glu Lys Arg Lys Asn lie 
180 185 190 

Asp Tyr Lys Gly Tyr Val Ala Arg Gin Asn Lys Ser Arg Pro Asp Glu 
195 200 205 

Asp Asp Gin Val Leu Asn Leu Gin Phe Asn Trp Lys Glu Met Val Lys 
210 215 220 



4 



Pro Val Gly Ser Ser Phe lie Gly Val Ser Pro Glu Phe Glu Phe Ala 
225 230 235 240 

Leu Tyr Thr lie Val Phe Leu Ala Ser Gin Glu Lys Met Ser Arg Glu 
245 250 255 

Val Val Arg Leu Glu Glu Tyr Glu Leu Gin lie Val Val Asn Arg His 
260 265 270 

Gly Arg Tyr lie Gly Thr Ala Tyr Pro Val Leu Leu Ser Thr Asn Asn 
275 280 285 

Pro Asp Leu Tyr 
290 



<210> 3 
<211> 20 
<212> RNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 3 

ggaaacguau ccuuugggag 20 



<210> 4 
<211> 20 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 4 

ggaaacguau ccuugggagg 20 



<210> 5 
<211> 20 
<212> RNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 5 

ggaaacguau ccucugggag 



20 



<211> 20 
<212> RNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<400> 6 

ggaaacguau ccugugggag 



<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (6) 
<223> inosine 

<220> 

<221> modif ied_base 
<222> (18) 
<223> inosine 

<400> 7 

atggcncayg aytayytngt 



<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 

<220> 

<221> modif ied_base 
<222> (3) 
<223> inosine 

<220> 

<221> modif ied_base 
<222> (9) 
<223> inosine 



6 



<220> 
<221> 
<222> 
<223> 



modi f ied 
(12) 

inosine 



base 



« 



<220> 

<221> modif ied_base 
<222> (15) 
<223> inosine 

<220> 

<221> modif ied^base 
<222> (18) 
<223> inosine 

<400> 8 

acnggrtang cngtnccnat 20 



<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<210> 10 
<211> 15 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Synthetic 
oligonucleotide 



<400> 9 

aagcttcttc atggcggctc ggccaat 



27 



<400> 10 

tcttttcatt cattt 
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